Uma Ferramenta para Identificar Desvios de Linguagem na Língua Portuguesa (A tool to identify the linguistic deviations in the Portuguese Language)[In Portuguese]

نویسندگان

Jonathan Nau

Aluízio Haendchen Filho

Guilherme Passero

Vinicius Cavaco

چکیده

Abstract. The revision of formal texts is a complex task and occurs in several areas. The objective of this work is to create a tool to support the revision of texts and promote studies in automatic correction of descriptive texts. We propose a reviewer for automatic identification of language deviations in formal descriptive texts using natural language processing techniques. A case study was carried out to evaluate the proposed approach in a public set of essays. The tool identified 3,255 deviations in a universe of 762 essays.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Geração de features para resolução de correferência: Pessoa, Local e Organização (Feature Generation for Coreference Resolution: Person, Location and Organization) [in Portuguese]

This work aims at resolving coreference in Portuguese, focusing on categories of named entities Person, Location and Organization. The proposed method uses supervised learning. To this end, the use of features that assist in the correct classification of named entities is critical. The construction and refinement of these features are of great relevance to his task. The performance of many othe...

متن کامل

RePort - Um Sistema de Extração de Informações Aberta para Língua Portuguesa (Report - An Open Information Extraction System for Portuguese Language)

An emerging field of research in Natural Language Processing (NLP) proposes Open Information Extraction systems (Open IE). Open IEs follow a domain-independent extraction paradigm that uses generic patterns to extract all relationships between entities. In this work, we present RePort, a method of Open IE for Portuguese, based on the ReVerb, an approach for English. Adaptations of syntactic and...

متن کامل

Identificação de Autoria de Textos através do uso de Classes Linguísticas da Língua Portuguesa (Authorship Identification Using Linguistic Classes for Portuguese) [in Portuguese]

The computational solution uses to solve problems related to the authorship identification and verification has grown progressively in areas such as computing, linguistics and law. This article aims to provide a method for the identification of authors ot text, based on a conjunct of attributes stilometry, using on the characteristics of Portuguese language. Resumo. A utilização do meio computa...

متن کامل

Criando um corpus sobre desastres climáticos com apoio da ferramenta NLTK (Creating a Corpus about Climate Disasters with the Support of the NLTK Tool) [in Portuguese]

This work is part of a broader research that explores information from a corpus of news about climate disasters and automatically recognizes, with the support of a tool for Natural Language Processing (NLP), words that denote the main actors involved and their actions in providing relief to victims. It starts with the hypothesis of Steinberger [2005] that news reports of disasters not only allo...

متن کامل

Análise Automática de Coerência Textual em Resumos Científicos: Avaliando Quebras de Linearidade (Automatic Analysis of Textual Coherence in Scientific Abstracts: Evaluating Linearity Breaks)

This paper presents an extension of the coherence analysis module that is part of the writing tool called SciPo, allowing it to automate the analysis of the coherence dimension called Linearity Break. The proposed implementation is based on a combination of the entity grid model and information from the rhetorical structure of scientific abstracts, allowing it to generate messages that indicate...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Uma Ferramenta para Identificar Desvios de Linguagem na Língua Portuguesa (A tool to identify the linguistic deviations in the Portuguese Language)[In Portuguese]

نویسندگان

چکیده

منابع مشابه

Geração de features para resolução de correferência: Pessoa, Local e Organização (Feature Generation for Coreference Resolution: Person, Location and Organization) [in Portuguese]

RePort - Um Sistema de Extração de Informações Aberta para Língua Portuguesa (Report - An Open Information Extraction System for Portuguese Language)

Identificação de Autoria de Textos através do uso de Classes Linguísticas da Língua Portuguesa (Authorship Identification Using Linguistic Classes for Portuguese) [in Portuguese]

Criando um corpus sobre desastres climáticos com apoio da ferramenta NLTK (Creating a Corpus about Climate Disasters with the Support of the NLTK Tool) [in Portuguese]

Análise Automática de Coerência Textual em Resumos Científicos: Avaliando Quebras de Linearidade (Automatic Analysis of Textual Coherence in Scientific Abstracts: Evaluating Linearity Breaks)

عنوان ژورنال:

اشتراک گذاری